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SEQUENCE LISTING 



<110> BASF Plant Science GmbH 
SweTree Technologies AB 



<120> IMPROVED CONSTRUCTS FOR ^aARKER EXCISION BASED ON DUAL-FUNCTION 
SELECTION MARKER 

<130> PF 55443 EP 

<160> 16 

<170> PatentIn version 3.1 

<210> 1 

<211> 1160 

<212> DNA 

<213> Rhodosporidium toruloides 
<220> 

<221> CDS 

<222> (1) . . (1104) 

<223> coding for DAAO 

<400> 1 

atg cac teg cag aag cgc gtc gtt gtc etc gga tea ggc gtt ate ggt 48 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 
1 5 10 15 

ctg age age gee etc ate etc get egg aag ggc tac age gtg cat att 96 
Leu Ser Ser Ala Leu lie Leu Ala Arg Lys Gl-y Tyr Ser Val His lie 

20 25 30 

etc gcg cgc gac ttg ccg gag gac gtc teg age cag act ttc get tea 144 
Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
35 40 45 



cca tgg get ggc gcg aat tgg acg cct ttc atg acg ctt aca gac ggt 192 

Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 
50 55 60 

cct ega caa gca aaa tgg gaa gaa teg act ttc aag aag tgg gtc gag 24 0 

Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu 
65 70 75 80 
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ttg gtc ccg acg ggc cat gcc atg tgg etc aag ggg acg agg egg ttc 2 88 

Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Arg Phe 

85 90 95 

5 

gcg cag aac gaa gac ggc ttg etc ggg cac tgg tac aag gac ate acg 336 

Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp lie Thr 

100 105 110 

10 cca aat tac cgc ccc etc cea tet tec gaa tgt cea cet ggc get ate 384 

Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala lie 
115 120 125 

ggc gta ace tac gac ace etc tec gtc cac gca cca aag tac tgc cag 432 

15 Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 
130 135 140 

tac ett gca aga gag ctg cag aag etc ggc gcg acg ttt gag aga egg 4 80 

Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 

20 145 150 155 160 

ace gtt acg teg ett gag cag gcg ttc gac ggt gcg gat ttg gtg gtc 52 8 

Thr Val Thr Ser Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 

165 170 175 

25 

aac get acg gga ett ggc gee aag teg att gcg ggc ate gac gac caa 576 

Asn Ala Thr Gly Leu Gly Ala Lys Ser lie Ala Gly lie Asp Asp Gin 

180 185 190 

30 gcc gcc gag cca ate cgc ggg caa ace gtc etc gtc aag tec cea tgc 624 

Ala Ala Glu Pro lie Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
195 200 205 

aag ega tgc acg atg gac teg tec gac ccc get tct ccc gee tac ate 672 

35 Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr lie 

210 215 220 

att ccc ega cca ggt ggc gaa gtc ate tgc ggc ggg acg tac ggc gtg 720 

lie Pro Arg Pro Gly Gly Glu Val lie Cys Gly Gly Thr Tyr Gly Val 

40 225 230 235 240 

gga gac tgg gac ttg tct gtc aac cca gag acg gtc cag egg ate etc 768 

Gly Asp Trp Asp Leu Ser Val Asn Pro Glu Thr Val Gin Arg lie Leu 

245 250 255 

45 
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aag cac tgc ttg cgc etc gac ccg acc ate teg age gae gga acg ate 816 
Lys His Cys Leu Arg Leu Asp Pro Thr lie Ser Ser Asp Gly Thr lie 

260 265 270 

5 gaa ggc ate gag gtc etc cgc cac aac gtc ggc ttg cga cet gea ega 864 
Glu Gly lie Glu Val Leu Arg His Asn Val Gly Leu Arg Pro Ala Arg 
275 280 285 

cga ggc gga ccc cgc gtt gag gea gaa egg ate gtc ctg cet etc gac 912 
10 Arg Gly Gly Pro Arg Val Glu Ala Glu Arg lie Val Leu Pro Leu Asp 
290 295 300 

egg aca aag teg ccc etc teg etc ggc agg ggc age gea ega geg geg 960 
Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 
15 305 310 315 320 

aag gag aag gag gte aeg ett gtg cat geg tat ggc ttc teg agt geg 1008 

Lys Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

325 330 335 

20 

gga tae eag eag agt tgg ggc geg geg gag gat gte geg eag etc gte 1056 

Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 

340 345 350 

25 gac gag geg ttc eag egg tae cac ggc geg geg egg gag teg aag ttg 1104 
Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 
355 360 365 



30 



40 



tagggeggga tttgtggctg tattgeggge atctacaaga aaaaaaaaaa aaaaaa 1160 



<210> 2 

<211> 368 

<212> PRT 

35 <213> Rhodosporidium toruloides 

<400> 2 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 
15 10 15 



Leu Ser Ser Ala Leu lie Leu Ala Arg Lys Gly Tyr Ser Val His lie 

20 25 30 



Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
45 35 40 45 
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Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 
50 55 60 

5 Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu 
65 70 75 80 

Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Arg Phe 

85 90 95 

10 

Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp He Thr 

100 105 110 

Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala He 
15 115 120 125 

Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 
130 135 140 

20 Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 
145 150 155 160 

Thr Val Thr Ser Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 

165 170 175 

25 

Asn Ala Thr Gly Leu Gly Ala Lys Ser He Ala Gly He Asp Asp Gin 

180 185 190 

Ala Ala Glu Pro He Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
30 195 200 205 

Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr He 
210 215 220 

35 He Pro Arg Pro Gly Gly Glu Val He Cys Gly Gly Thr Tyr Gly Val 
225 230 235 240 

Gly Asp Trp Asp Leu Ser Val Asn Pro Glu Thr Val Gin Arg He Leu 

245 250 255 

40 

Lys His Cys Leu Arg Leu Asp Pro Thr He Ser Ser Asp Gly Thr He 

260 265 270 

Glu Gly He Glu Val Leu Arg His Asn Val Gly Leu Arg Pro Ala Arg 
45 275 280 285 
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Arg Gly Gly Pro Arg Val Glu Ala Glu Arg lie Val Leu Pro Leu Asp 
290 295 300 

5 Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 

305 310 315 320 

Lys Glu Lys Glu Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 

325 330 335 



10 



Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 

340 345 350 



Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 
15 355 360 365 

<210> 3 

<211> 1005 

<212> DNA 

20 <213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1) . . (1002) 

25 <223> coding for DAAO 

<400> 3 

atg gca aac ata att ccg aag att gca att ate ggc gaa gga gtc att 4 8 

Met Ala Asn lie lie Pro Lys lie Ala lie lie Gly Glu Gly Val lie 
30 1 5 10 15 

gga tgt act tea gca ctt caa ata tea aaa get ata cea aat gcg aaa 96 

Gly Cys Thr Ser Ala Leu Gin lie Ser Lys Ala lie Pro Asn Ala Lys 

20 25 30 

35 

ata aet gtg etc cac gat aaa cea ttt aaa aaa teg tge agt gca gga 144 

lie Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 
35 40 45 

40 eea gca gga tta ttt aga ate gat tat gag gag aat aet gaa tae gga 192 
Pro Ala Gly Leu Phe Arg lie Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 60 



45 



cgt get tet ttc gee tgg tte tea cat etc tat ege aet aca aaa gga 240 
Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
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65 70 75 80 

tec gaa acc ggc gtg aaa tta gtt tct gga cat att caa tec gac aac 288 
Ser Glu Thr Gly Val Lys Leu Val Ser Gly His lie Gin Ser Asp Asn 

85 90 95 

ttg gag tea ttg aag caa caa caa aga gcc tat ggc gat att gtg tac 336 
Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp lie Val Tyr 

100 105 110 

aac ttt aga ttc ttg gat gat aga gaa egg ctg gac att ttt ccc gaa 384 
Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp lie Phe Pro Glu 
115 120 125 

15 cca tea aag cac tgc att cac tac acc gcc tac gca tea gaa ggt aac 432 

Pro Ser Lys His Cys lie His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 

aag tac gtg cct tat ttg aag aat ttg ctg ctt gag caa aaa ate gag 4 80 

20 Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys lie Glu 
145 150 155 160 

ttc aag caa caa gaa gtg acg agt ttg gac gca gte gee gac get ggt 528 
Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 
25 165 170 175 

tac gat gtt att gta aac tgc gca ggc ttg tac ggt gga aag ttg get 576 
Tyr Asp Val lie Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

180 185 190 

30 

ggt gat gac gat act tgc tac ccc att aga gga gte att ttg gaa gtt 624 
Gly Asp Asp Asp Thr Cys Tyr Pro lie Arg Gly Val lie Leu Glu Val 
195 200 205 

35 gat gca cca tgg cac aag cac ttc aat tat ega gac ttt act act ttc 672 
Asp Ala Pro Trp His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
210 215 220 

aca att cca aaa gag cac age gtg gtg gtt ggg tec acc aag cag gac 720 
40 Thr lie Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

aat ega tgg gat ttg gag ate acc gac gag gat aga aat gat att ttg 768 

Asn Arg Trp Asp Leu Glu lie Thr Asp Glu Asp Arg Asn Asp lie Leu 
45 245 250 255 



t* 
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aaa cga tac att get tta cat cct gga atg aga gag cca aag att ate 816 

Lys Arg Tyr lie Ala Leu His Pro Gly Met Arg Glu Pro Lys lie lie 

260 265 270 

aaa gaa tgg tea gea ctt cgc ecg gga cgt aag eat gtc aga att gaa 864 
Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg lie Glu 
275 280 285 

10 gcg cag aag agg aca tet gtt gga aac tea aaa gat tat atg gtt gtg 912 
Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 

290 295 300 

cat cac tat ggt cac ggg age aac gga ttc acg ttg ggt tgg gga aca 960 
15 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

gea att gaa gea act aaa ctt gtt aag act gea eta gga tta taa 1005 
Ala. lie Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 
20 325 330 

<210> 4 

<211> 334 

<212> PRT 

25 <213> Caenorhabditis elegans 

<400> 4 

Met Ala Asn He He Pro Lys He Ala He He Gly Glu Gly Val He 
1 5 10 15 • 



30 



Gly Cys Thr Ser Ala Leu Gin He Ser Lys Ala He Pro Asn Ala Lys 

20 25 30 



He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 
35 35 40 45 

Pro Ala Gly Leu Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 

50 55 60 

40 Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
65 70 75 80 

Ser Glu Thr Gly Val Lys Leu Val Ser Gly His He Gin Ser Asp Asn 

85 90 95 

45 
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Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp lie Val Tyr 

100 105 110 

Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp lie Phe Pro Glu 
5 115 120 125 

Pro Ser Lys His Cys lie His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 

10 Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys lie Glu 
145 150 155 160 

Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 

165 170 175 

15 

Tyr Asp Val lie Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 

180 185 190 

Gly Asp Asp Asp Thr Cys Tyr Pro lie Arg Gly Val lie Leu Glu Val 
20 195 200 205 

Asp Ala Pro Trp His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
210 215 220 

25 Thr lie Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

Asn Arg Trp Asp Leu Glu lie Thr Asp Glu Asp Arg Asn Asp lie Leu 

245 250 255 

30 

Lys Arg Tyr lie Ala Leu His Pro Gly Met Arg Glu Pro Lys lie lie 

260 265 270 

Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg lie Glu 
35 275 280 285 

Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
290 295 300 

40 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 



45 



Ala lie Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 

325 330 
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<210> 5 

<211> 1186 

<212> DNA 

<213> Nectria haematococca 
<220> 

<221> CDS 

<222> (42) . . (1124) 

<223> coding for DAAO 

<400> 5 

agcgacttga atttagcgaa aagaacttgt caaccacaat c atg tec aac aca ate 56 

Met Ser Asn Thr lie 
1 5 

gtc gtc gtt ggt gcc ggt gtc att ggc ttg acg teg gee ttg ttg etc 104 
Val Val Val Gly Ala Gly Val lie Gly Leu Thr Ser Ala Leu Leu Leu 

10 15 20 

20 tec aag aac aag ggc aac aag ate ace gtc gtg gcc aag cac atg ccc 152 
Ser Lys Asn Lys Gly Asn Lys lie Thr Val Val Ala Lys His Met Pro 

25 30 35 



15 



ggc gae tat gae gtt gaa tac gcc teg cet ttt get ggt gee aac cac 200 

25 Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe Ala Gly Ala Asn His 

40 45 50 

tec ccc atg gcg acg gaa gag age age gaa tgg gaa egt cgc act tgg 24 8 

Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp Glu Arg Arg Thr Trp 
30 55 60 65 

tac gag ttt aag aga ctg gtc gag gag gtc cet gag gee ggt gtt eat 2 96 

Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro Glu Ala Gly Val His 
70 75 80 85 

35 

ttc cag aag tct cgc ate cag agg cgc aat gtg gae act gaa aag gcg 344 

Phe Gin Lys Ser Arg lie Gin Arg Arg Asn Val Asp Thr Glu Lys Ala 

90 95 100 

40 cag agg tct ggt ttc cca gae gcc etc ttc teg aaa gaa ccc tgg ttc 392 

Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser Lys Glu Pro Trp Phe 

105 110 115 



45 



aag aac atg ttt gag gae ttc egt gag cag cac cet age gag gtc ate 440 
Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His Pro Ser Glu Val lie 
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120 125 130 

ccc ggt tac gac tct ggc tgc gag ttc aca teg gtg tgc ate aae aeg 488 
Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser Val Cys lie Asn Thr 
135 140 145 

gee ate tae etc ccc tgg etc etc ggc cag tgc ate aag aat ggc gtc 536 
Ala lie Tyr Leu Pro Trp Leu Leu Gly Gin Cys lie Lys Asn Gly Val 
150 155 160 165 

ate gtc aag cge gee ate etc aae gac att age gag gee aag aag etg 584 
lie Val Lys Arg Ala lie Leu Asn Asp lie Ser Glu Ala Lys Lys Leu 

170 175 180 

15 age cac gcg gge aag aeg ccc aat ate ate gtc aae gee aeg ggt etc 632 
Ser His Ala Gly Lys Thr Pro Asn lie lie Val Asn Ala Thr Gly Leu 

185 190 195 

ggc tee tae aag etg gge ggt gtc gag gac aag ace atg gcg cet gcg 680 
20 Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys Thr Met Ala Pro Ala 

200 205 210 

egg 95r^ ca9 sitt gtg gtt gtg cge aae gag age age ccc atg etc etc 72 8 

Arg Gly Gin lie Val Val Val Arg Asn Glu Ser Ser Pro Met Leu Leu 
25 215 220 225 

act tea ggt gtc gag gac ggc ggt get gat gtc atg tac ttg atg cag 776 
Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val Met Tyr Leu Met Gin 
230 235 240 245 

30 

cga gea get ggc ggt gge ace ate etg ggc ggt ace tac gac gtt gge 824 
Arg Ala Ala Gly Gly Gly Thr lie Leu Gly Gly Thr Tyr Asp Val Gly 

250 255 260 

35 aae tgg gag tct cag eca gac ccc aae ate gcg aat cge ate atg cag 872 
Asn Trp Glu Ser Gin Pro Asp Pro Asn lie Ala Asn Arg lie Met Gin 

265 270 275 

cge ate gtc gag gtg egg ccc gag att gee aac gge aag gge gtc aag 92 0 

40 Arg lie Val Glu Val Arg Pro Glu lie Ala Asn Gly Lys Gly Val Lys 

280 285 290 

999 etg age gtg ate cga cac gee gtc gge atg egg eeg tgg cga aag 968 
Gly Leu Ser Val lie Arg His Ala Val Gly Met Arg Pro Trp Arg Lys 
45 295 300 305 
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gac gga gtc agg ate gag gag gag aag ctg gat gat gag act tgg ate 1016 
Asp Gly Val Arg lie Glu Glu Glu Lys Leu Asp Asp Glu Thr Trp He 
310 315 320 325 

5 

gtg cac aac tac gga cac tct gga tgg ggt tac cag ggt teg tat ggt 1064 
Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr Gin Gly Ser Tyr Gly 

330 335 340 

10 tgt get gag aat gta gte cag ttg gtt gac aag gte gge aag gcg gee 1112 
Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys Val Gly Lys Ala Ala 

345 350 355 

aag tct aag etg tagttgaaaa ggeetgaatg agtaatagta attggatatt 1164 
15 Lys Ser Lys Leu 

360 

ggaaataccg tatttgecct eg 1186 

* 

20 <210> 6 

.<211> 361 
<212> PRT 

<213> Nectria haematoeocea 
25 <400> 6 

Met Ser Asn Thr He Val Val Val Gly Ala Gly Val He Gly Leu Thr 
15 10 15 

Ser Ala Leu Leu Leu Ser Lys Asn Lys Gly Asn Lys He Thr Val Val 
30 20 25 30 

Ala Lys His Met Pro Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe 
35 40 45 

35 Ala Gly Ala Asn His Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp 

50 55 60 

Glu Arg Arg Thr Trp Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro 
65 70 75 80 



40 



Glu Ala Gly Val His Phe Gin Lys Ser Arg He Gin Arg Arg Asn Val 

85 90 95 



Asp Thr Glu Lys Ala Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser 

45 100 105 110 
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Lys Glu Pro 
115 

5 Pro Ser Glu 
130 

Val Cys lie 
145 

10 

lie Lys Asn 
Glu Ala Lys 

15 

Asn Ala Thr 
195 

20 Thr Met Ala 

210 

Ser Pro Met 

225 

25 

Met Tyr Leu 
Thr Tyr Asp 

30 

Asn Arg lie 
275 

35 

Gly Lys Gly 
290 

Arg Pro Trp 
40 305 

Asp Glu Thr 



Trp Phe Lys Asn 



Val lie Pro Gly 

135 

Asn Thr Ala lie 
150 

Gly Val lie Val 
165 

Lys Leu Ser His 
180 

Gly Leu Gly Ser 



Pro Ala Arg Gly 

215 

Leu Leu Thr Ser 
230 

Met Gin Arg Ala 
245 

Val Gly Asn Trp 
260 

Met Gin Arg lie 



Val Lys Gly Leu 

295 

Arg Lys Asp Gly 
310 

Trp lie Val His 
325 



12 

Met Phe Glu Asp Phe 
120 

Tyr Asp Ser Gly Cys 

14 0 

Tyr Leu Pro Trp Leu 

155 

Lys Arg Ala lie Leu 
170 

Ala Gly Lys Thr Pro 
185 

Tyr Lys Leu Gly Gly 

200 

Gin lie Val Val Val 

220 

Gly Val Glu Asp Gly 

235 

Ala Gly Gly Gly Thr 
250 

Glu Ser Gin Pro Asp 
265 

Val Glu Val Arg Pro 

280 



Ser Val lie Arg His 

300 

Val Arg lie Glu Glu 

315 

Asn Tyr Gly His Ser 
330 



Arg Glu Gin His 
125 

Glu Phe Thr Ser 



Leu Gly Gin Cys 

160 

Asn Asp lie Ser 
175 

Asn lie lie Val 
190 

Val Glu Asp Lys 

205 

Arg Asn Glu Ser 



Gly Ala Asp Val 

240 

lie Leu Gly Gly 
255 

Pro Asn lie Ala 
270 

Glu lie Ala Asn 

285 



Ala Val Gly Met 



Glu Lys Leu Asp 

320 

Gly Trp Gly Tyr 
335 



45 



48 
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Gin Gly Ser Tyr Gly Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys 

340 345 350 

Val Gly Lys Ala Ala Lys Ser Lys Leu 
5 355 360 

<210> 7 

<211> 1071 

<212> DNA 

10 <213> Trigonopsis variabilis 

<220> 

<221> CDS 
<222> (1) . . (1068) 
15 <223> 

<400> 7 

atg get aaa ate gtt gtt att ggt gcc ggt gtt gcc ggt tta act aca 
Met Ala Lys lie Val Val lie Gly Ala Gly Val Ala Gly Leu Thr Thr 
20 1 5 10 15 

get ctt caa ett ett egt aaa gga cat gag gtt aca att gtg tee gag 96 
Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr lie Val Ser Glu 

20 25 30 

25 

ttt acg ccc ggt gat ctt agt ate gga tat ace teg ect tgg gea ggt 144 
Phe Thr Pro Gly Asp Leu Ser lie Gly Tyr Thr Ser Pro Trp Ala Gly 
35 40 45 

30 gee aac tgg etc aca ttt tac gat gga ggc aag tta gcc gae tac gat 192 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
50 55 60 

gee gte tet tat cet ate ttg ega gag ctg get ega age age cec gag 24 0 

35 Ala Val Ser Tyr Pro lie Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 75 80 

get gga att ega etc ate age caa egc tee eat gtt etc aag cgt gat 288 
Ala Gly lie Arg Leu lie Ser Gin Arg Ser His Val Leu Lys Arg Asp 
40 85 90 95 

ctt act aaa ctg gaa gtt gcc atg teg gcc ate tgt caa cgc aat ccc 336 

Leu Pro Lys Leu Glu Val Ala Met Ser Ala lie Cys Gin Arg Asn Pro 

100 105 110 

45 
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tgg ttc aaa aac aca gtc gat tct ttc gag att ate gag gac agg tec 

Trp Phe Lys Asn Thr Val Asp Ser Phe Glu lie lie Glu Asp Arg Ser 
115 120 125 

agg att gtc cac gat gat gtg get tat eta gtc gaa ttt cgt tec gtt 

Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 
130 135 140 

tgt ate cac ace gga gtc tac ttg aac tgg ctg atg tee caa tge tta 

Cys lie His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 

145 150 155 160 

teg etc gge gee aeg gtg gtt aaa cgt ega gtg aac eat ate aag gat 

Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His lie Lys Asp 

165 170 175 

gee aat tta eta cac tec tea gga tea cgc ccc gae gtg att gtc aac 

Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val lie Val Asn 

180 185 190 

tgt agt ggt etc ttt gee egg tte ttg gga gge gtc gag gae aag aag 

Cys Ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Lys 
195 200 205 

atg tac eet att ega gga caa gtc gtc ett gtt ega aac tct ett ect 

Met Tyr Pro lie Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 
210 215 220 

ttt atg gcc tec ttt tec age act ect gaa aaa gaa aat gaa gae gaa 
Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 

225 230 235 240 



get eta tat ate atg ace ega tte gat ggt act tct ate att gge ggt 
Ala Leu Tyr lie Met Thr Arg Phe Asp Gly Thr Ser lie lie Gly Gly 

245 250 255 

tgt tte caa ccc aac aac tgg tea tee gaa ccc gat eet tct etc ace 
Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 

260 265 270 



eat ega ate ctg tct aga gee etc gae ega tte eeg gaa ctg ace aaa 
His Arg lie Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
275 280 285 
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gat ggc cct ctt gac att gtg cgc gaa tgc gtt ggc cac cgt cct ggt 912 
Asp Gly Pro Leu Asp lie Val Arg Glu Cys Val Gly His Arg Pro Gly 
290 295 300 

5 aga gag ggc ggt ccc cga gta gaa tta gag aag ate ccc ggc gtt ggc 960 
Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys lie Pro Gly Val Gly 
305 310 315 320 

ttt gtt gtc cat aac tat ggt gcc gcc ggt get ggt tac caa tec tct 1008 
10 Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 330 335 

tac ggc atg get gat gaa get gtt tct tac gtc gaa aga get ctt act 1056 
Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 
15 340 345 350 

cgt cea aac ctt tag 1071 
Arg Pro Asn Leu 
355 

20 

<210> 8 
<211> 356 
<212> PRT 
25 <213> Trigonopsis variabilis 

<400> 8 

Met Ala Lys lie Val Val lie Gly Ala Gly Val Ala Gly Leu Thr Thr 
15 10 15 



30 



Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr He Val Ser Glu 

20 25 30 



Phe Thr Pro Gly Asp Leu Ser He Gly Tyr Thr Ser Pro Trp Ala Gly 
35 35 40 45 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
50 55 60 

40 Ala Val Ser Tyr Pro He Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 75 80 

Ala Gly He Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 

85 90 95 

45 
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Leu Pro Lys Leu 

100 

Trp Phe Lys Asn 
115 

Arg lie Val His 
130 

Cys lie His Thr 
145 

Ser Leu Gly Ala 



Ala Asn Leu Leu 

180 

Cys Ser Gly Leu 
195 

Met Tyr Pro lie 
210 

Phe Met Ala Ser 
225 

Ala Leu Tyr lie 



Cys Phe Gin Pro 

260 

His Arg lie Leu 

275 

Asp Gly Pro Leu 

290 

Arg Glu Gly Gly 
305 



Glu Val Ala Met 



Thr Val Asp Ser 

120 

Asp Asp Val Ala 
135 

Gly Val Tyr Leu 
150 

Thr Val Val Lys 
165 

His Ser Ser Gly 



Phe Ala Arg Phe 

200 

Arg Gly Gin Val 

215 

Phe Ser Ser Thr 
230 

Met Thr Arg Phe 
245 

Asn Asn Trp Ser 



Ser Arg Ala Leu 

280 

Asp lie Val Arg 

295 

Pro Arg Val Glu 
310 



16 

Ser Ala lie Cys 
105 

Phe Glu lie lie 



Tyr Leu Val Glu 

14 0 

Asn Trp Leu Met 
155 

Arg Arg Val Asn 

170 

Ser Arg Pro Asp 
185 

Leu Gly Gly Val 



Val Leu Val Arg 

220 

Pro Glu Lys Glu 
235 

Asp Gly Thr Ser 
250 

Ser Glu Pro Asp 
265 

Asp Arg Phe Pro 



Glu Cys Val Gly 

300 

Leu Glu Lys lie 
315 



Gin Arg Asn Pro 
110 

Glu Asp Arg Ser 

125 

Phe Arg Ser Val 



Ser Gin Cys Leu 

160 

His lie Lys Asp 

175 

Val lie Val Asn 
190 

Glu Asp Lys Lys 
205 

Asn Ser Leu Pro 



Asn Glu Asp Glu 

240 

lie lie Gly Gly 
255 

Pro Ser Leu Thr 
270 

Glu Leu Thr Lys 
285 

His Arg Pro Gly 



Pro Gly Val Gly 

320 



Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 330 335 
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Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 

340 345 350. 

Arg Pro Asn Leu 
5 355 



<210> 9 

<211> 1047 

10 <212> DNA 

<213> Schizosaccharomyces pombe 

<220> 

<221> CDS 

15 <222> (22) . . (1041) 

<22 3> coding for DAAO 

<400> 9 

atgactaagg aaaataagcc a aga gat att gtc ate gtt ggc get ggc gtt 51 

20 Arg Asp lie Val lie Val Gly Ala Gly Val 

15 10 

^tt gga ttg acc act get tgg att ett tea gae ttg ggt ctt get ect 99 
lie Gly Leu Thr Thr Ala Trp lie Leu Ser Asp Leu Gly Leu Ala Pro 
25 15 20 25 

cgt att aag gtg att gcc aag tat acg ect gaa gat cgt tct gta gaa 147 

Arg lie Lys Val lie Ala Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu 

30 • 35 40 

30 

tac act tec ect tgg get ggc gca aat ttc tgt age att tct get act 195 

Tyr Thr Ser Pro Trp Ala Gly Ala Asn Phe Cys Ser lie Ser Ala Thr 
45 50 55 

35 gat gae aat get ttg cgc tgg gat aaa ate act tac eat cgt ttc gee 243 
Asp Asp Asn Ala Leu Arg Trp Asp Lys lie Thr Tyr His Arg Phe Ala 
60 65 70 

tac ttg geg aaa act cgt ect gaa gca gga ate cgt ttt get gat ett 2 91 

40 Tyr Leu Ala Lys Thr Arg Pro Glu Ala Gly lie Arg Phe Ala Asp Leu 
75 80 85 90 

ega gaa ttg tgg gag tae gag ccg aaa eae gae aaa ate aga tee tgg 33 9 

Arg Glu Leu Trp Glu Tyr Glu Pro Lys His Asp Lys lie Arg Ser Trp 
45 95 100 105 



40 



387 
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aat acc tat gtc aga gat ttc aaa gtt ate cct gaa aaa gat ctt cca 
Asn Thr Tyr Val Arg Asp Phe Lys Val lie Pro Glu Lys Asp Leu Pro 

110 115 120 

5 gga gaa tgt ate tac gga cat aag gcc acc acc ttt tta ate aae get 435 
Gly Glu Cys He Tyr Gly His Lys Ala Thr Thr Phe Leu He Asn Ala 
125 130 135 

eet cat tac ttg aat tat atg tac aag ctg etc att gaa get ggc gtc 483 
10 Pro His Tyr Leu Asn Tyr Met Tyr Lys Leu Leu He Glu Ala Gly Val 

140 145 150 

gaa ttt gaa aag aaa gaa ttg agt cac ate aaa gag act gtc gaa gaa 531 
Glu Phe Glu Lys Lys Glu Leu Ser His He Lys Glu Thr Val Glu Glu 
15 155 160 165 170 

act cca gaa get tea gta gta ttt aat tgc act ggt etc tgg get tec 579 

Thr Pro Glu Ala Ser Val Val Phe Asn Cys Thr Gly Leu Trp Ala Ser 

175 180 185 

20 

aaa ttg ggt ggc gtt gaa gac ecg gae gtt tat eeg act egt gga cat 627 

Lys Leu Gly Gly Val Glu Asp Pro Asp Val Tyr Pro Thr Arg Gly His 

190 195 200 

25 gtt gtt ttg gtt aag get cct cat gta aca gaa act cgc att ttg aat 675 
Val Val Leu Val Lys Ala Pro His Val Thr Glu Thr Arg He Leu Asn 
205 210 215 

ggc aag aac tct gat acc tat att att cct egt ccc tta aat ggt gga 723 
30 Gly Lys Asn Ser Asp Thr Tyr He He Pro Arg Pro Leu Asn Gly Gly 

220 225 230 

gtc att tgc ggc ggt ttc atg caa cca gga aac tgg gat egt gaa att 771 
Val He Cys Gly Gly Phe Met Gin Pro Gly Asn Trp Asp Arg Glu He 
35 235 240 245 250 

cac cct gaa gae act ttg gat ate ctt aag aga aca teg get ttg atg 819 
His Pro Glu Asp Thr Leu Asp He Leu Lys Arg Thr Ser Ala Leu Met 

255 260 265 



cca gaa ttg ttc cac ggc aag ggt eeg gag ggt get gaa att att caa 867 

Pro Glu Leu Phe His Gly Lys Gly Pro Glu Gly Ala Glu He He Gin 

270 275 280 



45 
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gaa tgt gtc gga ttc cgt cct tct cga aag ggt ggt gcc cgc gta gag 915 
Glu Cys Val Gly Phe Arg Pro Ser Arg Lys Gly Gly Ala Arg Val Glu 
285 290 295 

5 ctt gat gtt gtt ccc ggc acc tea gtc ccc ctt gtt cat gat tac ggt 963 
Leu Asp Val Val Pro Gly Thr Ser Val Pro Leu Val His Asp Tyr Gly 
300 305 310 

get tct ggc aca gga tac caa get ggt tat ggt atg get ctt gac tct 1011 
10 Ala Ser Gly Thr Gly Tyr Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser 
315 320 325 330 

gtc atg ttg get ctt cct aaa ate aaa ttg gcttag 1047 
Val Met Leu Ala Leu Pro Lys lie Lys Leu 
15 335 340 



<210> 10 

<211> 340 

20 <212> PRT 

<213> Schizosaecharomyces pombe 

<400> 10 

Arg Asp lie Val lie Val Gly Ala Gly Val He Gly Leu Thr Thr Ala 
25 1 5 10 15 

Trp He Leu Ser Asp Leu Gly Leu Ala Pro Arg He Lys Val He Ala 

20 25 30 

30 Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu Tyr Thr Ser Pro Trp Ala 

35 40 45 

Gly Ala Asn Phe Cys Ser He Ser Ala Thr Asp Asp Asn Ala Leu Arg 
50 55 60 

35 

Trp Asp Lys He Thr Tyr His Arg Phe Ala Tyr Leu Ala Lys Thr Arg 
65 70 75 80 

Pro Glu Ala Gly He Arg Phe Ala Asp Leu Arg Glu Leu Trp Glu Tyr 
40 85 90 95 

Glu Pro Lys His Asp Lys He Arg Ser Trp Asn Thr Tyr Val Arg Asp 

100 105 110 



45 
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Phe Lys Val lie Pro Glu Lys Asp Leu Pro Gly Glu Cys lie Tyr Gly 
115 120 125 

His Lys Ala Thr Thr Phe Leu lie Asn Ala Pro His Tyr Leu Asn Tyr 
5 130 135 140 

Met Tyr Lys Leu Leu lie Glu Ala Gly Val Glu Phe Glu Lys Lys Glu 
145 150 155 160 

10 Leu Ser His lie Lys Glu Thr Val Glu Glu Thr Pro Glu Ala Ser Val 

165 170 175 

Val Phe Asn Cys Thr Gly Leu Trp Ala Ser Lys Leu Gly Gly Val Glu 

180 185 190 

15 

Asp Pro Asp Val Tyr Pro Thr Arg Gly His Val Val Leu Val Lys Ala 
195 200 205 

Pro His Val Thr Glu Thr Arg lie Leu Asn Gly Lys Asn Ser Asp Thr 
20 210 215 220 

Tyr lie lie Pro Arg Pro Leu Asn Gly Gly Val lie Cys Gly Gly Phe 
225 230 235 240 

25 Met Gin Pro Gly Asn Trp Asp Arg Glu lie His Pro Glu Asp Thr Leu 

245 250 255 

Asp lie Leu Lys Arg Thr Ser Ala Leu Met Pro Glu Leu Phe His Gly 

260 265 270 

30 

Lys Gly Pro Glu Gly Ala Glu lie lie Gin Glu Cys Val Gly Phe Arg 
275 280 285 



35 Pro Ser Arg Lys 
290 

Thr Ser Val Pro 
305 

40 

Gin Ala Gly Tyr 



Gly Gly Ala Arg 
295 

Leu Val His Asp 
310 

Gly Met Ala Leu 

325 



Val Glu Leu Asp 

300 

Tyr Gly Ala Ser 
315 

Asp Ser Val Met 
330 



Val Val Pro Gly 



Gly Thr Gly Tyr 

320 

Leu Ala Leu Pro 

335 



Lys lie Lys Leu 
45 340 



10 



15 



20 
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<210> 11 
<211> .963 
<212> DNA 

<213> Streptomyces coelicolor 

5 

<220> 

<221> CDS 

<222> (31) . . (957) 

<22 3> coding for DAAO 

<220> 

<221> misc_f eature 

<222> (880) . . (936) 

<22 3> DAAO signature 

<400> 11 

gtggaaaccg aactggatga cgagcgggat ggc gaa gtc gtc gtg gtc ggc ggc 54 

Gly Glu Val Val Val Val Gly Gly 
1 5 

ggg gtg ate ggg ctg acg acg gcc gtc gtc etc gcc gag egg ggc aga 102 
Gly Val lie Gly Leu Thr Thr Ala Val Val Leu Ala Glu Arg Gly Arg 
10 15 20 

25 egg gtg egg ctg tgg ace egg gag eee gcg gag egg ace ace teg gtg 15 0 

Arg Val Arg Leu Trp Thr Arg Glu Pro Ala Glu Arg Thr Thr Ser Val 
25 30 35 40 

gta gcg ggc ggg ctg tgg tgg ecg tac egg ate gag ccg gtc gcg ctg 198 
30 Val Ala Gly Gly Leu Trp Trp Pro Tyr Arg lie Glu Pro Val Ala Leu 

45 50 55 

gee eag gee tgg gcg ctg egt tec ctg gae gtg tac gag gag ctg gcg 24 6 

Ala Gin Ala Trp Ala Leu Arg Ser Leu Asp Val Tyr Glu Glu Leu Ala 
35 60 65 70 

gca egg cec ggg eag ace ggc gta ege atg etc gaa ggg gtg etc ggc 294 

Ala Arg Pro Gly Gin Thr Gly Val Arg Met Leu Glu Gly Val Leu Gly 
75 80 85 

40 

gag ace ggc ctg gae gag gtg gae ggg tgg gee gcg gcc egg ctg ecg 342 
Glu Thr Gly Leu Asp Glu Val Asp Gly Trp Ala Ala Ala Arg Leu Pro 
90 95 100 

45 ggg ctg ege gcg gcg age gee gee gag tac gee ggg acg ggg ctg tgg 3 90 
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Gly Leu Arg Ala Ala Ser Ala Ala Glu Tyr Ala Gly Thr Gly Leu Trp 
105 110 115 120 

gcg egg ctg ccg etc ate gac atg teg acc cat etg ecg tgg ctg egg 43 8 

5 Ala Arg Leu Pro Leu lie Asp Met Ser Thr His Leu Pro Trp Leu Arg 

125 130 135 

gag egg etg etg gee geg ggc ggc aeg gtg gag gae ege gcg gtg ace 486 
Glu Arg Leu Leu Ala Ala Gly Gly Thr Val Glu Asp Arg Ala Val Thr 
10 140 145 150 

gat etg gee gag gcg gae gcg ccg gtg gtg gte aac tgc acc ggc ctg 534 

Asp Leu Ala Glu Ala Asp Ala Pro Val Val Val Asn Cys Thr Gly Leu 

155 160 165 

15 

ggc gcc egg gag ctg gtg ccg gac ccg gcg gta egg ccg gtg cgc gga 5 82 

Gly Ala Arg Glu Leu Val Pro Asp Pro Ala Val Arg Pro Val Arg Gly 

170 175 180 

20 cag etg gte gte gtg gag aac cec ggc ate eac aac tgg ctg gte geg 63 0 

Gin Leu Val Val Val Glu Asn Pro Gly lie His Asn Trp Leu Val Ala 
185 190 195 200 

gee gac geg gae tec ggg gag aeg aeg tac tte ctt ccg cag ccg gga 67 8 

25 Ala Asp Ala Asp Ser Gly Glu Thr Thr Tyr Phe Leu Pro Gin Pro Gly 

205 210 215 

egg etc etg etg ggc ggc aeg get gag gag gae gee tgg teg ace gag 726 
Arg Leu Leu Leu Gly Gly Thr Ala Glu Glu Asp Ala Trp Ser Thr Glu 
30 220 225 230 

ecg gae ecg gag gte geg geg gee ate gtg cga egg tgc geg gee ctg 774 

Pro Asp Pro Glu Val Ala Ala Ala lie Val Arg Arg Cys Ala Ala Leu 

235 240 245 

35 

egt cec gag ate gcc gga gcg egg gtg etc gcg cae ctg gtg ggg ctg 822 

Arg Pro Glu lie Ala Gly Ala Arg Val Leu Ala His Leu Val Gly Leu 

250 255 260 

40 egg ccg gcc egg gae gcg gte egg ctg gag cgc ggg aeg ctg ccg gac 870 
Arg Pro Ala Arg Asp Ala Val Arg Leu Glu Arg Gly Thr Leu Pro Asp 
265 270 275 280 

ggg cgc egg ctg gtg cae aac tac ggt cae ggc ggc gcg ggc gte acc 918 
45 Gly Arg Arg Leu Val His Asn Tyr Gly His Gly Gly Ala Gly Val Thr 
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285 290 295 

gtg gcc tgg ggc tgc get cag gag gcg gcc egg etc gcc tcctga 963 
Val Ala Trp Gly Cys Ala Gin Glu Ala Ala Arg Leu Ala 

300 305 



<210> 12 

<211> 309 

10 <212> PRT 

<213> Streptomyces coelicolor 

<220> 

<221> misc_feature 

15 <222> (880) . . (936) 

<22 3> DAAO signature 

<400> 12 

Gly Glu Val Val Val Val Gly Gly Gly Val lie Gly Leu Thr Thr Ala 
20 1 5 10 15 

Val Val Leu Ala Glu Arg Gly Arg .Arg Val Arg Leu Trp Thr Arg Glu 

20 25 30 

25 Pro Ala Glu Arg Thr Thr Ser Val Val Ala Gly Gly Leu Trp Trp Pro 

35 40 45 

Tyr Arg lie Glu Pro Val Ala Leu Ala Gin Ala Trp Ala Leu Arg Ser 
50 55 60 



30 



Leu Asp Val Tyr Glu Glu Leu Ala Ala Arg Pro Gly Gin Thr Gly Val 
65 70 75 80 



Arg Met Leu Glu Gly Val Leu Gly Glu Thr Gly Leu Asp Glu Val Asp 
35 85 90 95 

Gly Trp Ala Ala Ala Arg Leu Pro Gly Leu Arg Ala Ala Ser Ala Ala 

100 105 110 

40 Glu Tyr Ala Gly Thr Gly Leu Trp Ala Arg Leu Pro Leu lie Asp Met 

115 120 125 



45 



Ser Thr His Leu Pro Trp Leu Arg Glu Arg Leu Leii Ala Ala Gly Gly 
130 135 140 



PF 55443 

24 

Thr Val Glu Asp Arg Ala Val Thr Asp Leu Ala Glu Ala Asp Ala Pro 
145 150 155 160 

Val Val Val Asn Cys Thr Gly Leu Gly Ala Arg Glu Leu Val Pro Asp 
5 165 170 175 

Pro Ala Val Arg Pro Val Arg Gly Gin Leu Val Val Val Glu Asn Pro 

180 185 190 

10 Gly lie His Asn Trp Leu Val Ala Ala Asp Ala Asp Ser Gly Glu Thr 

195 200 205 

Thr Tyr Phe Leu Pro Gin Pro Gly Arg Leu Leu Leu Gly Gly Thr Ala 

210 215 220 



15 



Glu Glu Asp Ala Trp Ser Thr Glu Pro Asp Pro Glu Val Ala Ala Ala 
225 230 235 240 



lie Val Arg Arg Cys Ala Ala Leu Arg Pro Glu lie Ala Gly Ala Arg 
20 245 250 255 

Val Leu Ala His Leu Val Gly Leu Arg Pro Ala Arg Asp Ala Val Arg 

260 265 270 

25 Leu Glu Arg Gly Thr Leu Pro Asp Gly Arg Arg Leu Val His Asn Tyr 

275 280 285 

Gly His Gly Gly Ala Gly Val Thr Val Ala Trp Gly Cys Ala Gin Glu 
290 295 300 

30 

Ala Ala Arg Leu Ala 
305 

<210> 13 
35 <211> 1038 
<212> DNA 

<213> Candida boidinii 

<220> 

40 <221> CDS 

<222> (1) . . (1035) 

<223> coding for DAAO 



45 
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25 

<400> 13 

atg ggt gat caa att gtt gtt ctt ggt tec ggt att att ggt tta tat 4 8 

Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie lie Gly Leu Tyr 
15 10 15 

5 

act aca tac tgt tta ate tat gag get gga tgt get eca get aaa att 96 
Thr Thr Tyr Cys Leu lie Tyr Glu Ala Gly Cys Ala Pro Ala Lys lie 

20 25 30 

10 act att gtt get gaa ttt tta cca ggt gat caa tet aca tta tat aca 144 
Thr lie Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 

tct cca tgg gca ggt ggt aat ttt tet tgt att tea cca get gat gat 192 
15 Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys lie Ser Pro Ala Asp Asp 
50 55 60 

aca aca ttg get tat gat aaa ttc aca tat ctt aat tta tte aag att 240 
Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys lie 
20 65 70 75 80 

cac aaa aaa tta ggt gga cca gaa tgt gga tta gat aat aag cca agt 288 
His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 

85 90 95 

25 

act gaa tat tgg gat ttt tat ect ggt gat gaa aaa gtc aat tct tta 33 6 

Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 105 110 

30 aaa caa tat ctt aaa gat ttt aaa gtt att cca aaa tea gaa tta cca 3 84 

Lys Gin Tyr Leu Lys Asp Phe Lys Val lie Pro Lys Ser Glu Leu Pro 
115 120 125 

gaa ggt gtt gaa tat ggt att agt tat act aca tgg aat ttc aac tgt 432 
35 Glu Gly Val Glu Tyr Gly lie Ser Tyr Thr Thr Trp Asn Phe Asn Cys 

130 135 140 

ect gtt ttc tta caa aat atg get aat ttt tta aat aaa aga aat gtt 4 80 

Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
40 145 150 155 160 

acc att att aga aaa cat tta aca cat att tct caa get tat tta aca 528 
Thr lie lie Arg Lys His Leu Thr His lie Ser Gin Ala Tyr Leu Thr 

165 170 175 

45 
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gtt aat aca aaa gtt gtt ttc aac tgt aca ggt att ggt get get gat 576 
Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly lie Gly Ala Ala Asp 

180 185 190 

5 tta ggt ggt gtt aaa gat gaa aaa gtt tat cca act aga gga caa gtt 624 
Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

gtt gtt gtt aga get cca cat att caa gaa aat aaa atg aga tgg ggt 672 
10 Val Val Val Arg Ala Pro His lie Gin Glu Asn Lys Met Arg Trp Gly 
210 215 220 

aaa gac tat get act tat att att cca aga cca tat tct aat ggt gaa 72 0 

Lys Asp Tyr Ala Thr Tyr lie lie Pro Arg Pro Tyr Ser Asn Gly Glu 
15 225 230 235 240 

tta gtc tta ggt ggt ttc tta caa aag gat aat tgg aca ggt aat act 768 

Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

20 

ttt ggt ttt gaa act gat gat att gtt agt aga act aca tct tta tta 816 

Phe Gly Phe Glu Thr Asp Asp lie Val Ser Arg Thr Thr Ser Leu Leu 

260 265 270 

25 cca aag att tta gat gaa cca ctt cat att att aga gtt gca get ggt 864 
Pro Lys lie Leu Asp Glu Pro Leu His lie lie Arg Val Ala Ala Gly 
275 280 285 

tta aga cca agt aga eat ggt ggt cca aga att gaa get gaa gtt tgt 912 
30 Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 

290 295 300 

gaa gaa ggt aaa tta act att cat aat tat ggt get tct gga tat ggt 960 
Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
35 305 310 315 320 

« 

tat caa get ggt tat ggt atg tct tat gaa get gtc aaa ctt tta gtt 1008 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 



40 



gat aac caa aaa gtt aaa get aaa ctt tag 103 8 

Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 345 



45 
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<210> 14 
<211> 345 
<212> PRT 

<213> Candida boidinii 

5 

<400> 14 

Met Gly Asp Gin lie Val Val Leu Gly Ser Gly lie lie Gly Leu Tyr 
15 10 15 

10 Thr Thr Tyr Cys Leu lie Tyr Glu Ala Gly Cys Ala Pro Ala Lys lie 

20 25 30 

Thr lie Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 
35 40 45 

15 

Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys lie Ser Pro Ala Asp Asp 
50 55 60 

Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys lie 
20 65 70 75 80 

His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 

85 90 95 

25 Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

. 100 105 110 

Lys Gin Tyr Leu Lys Asp Phe Lys Val lie Pro Lys Ser Glu Leu Pro 
115 120 125 

30 

Glu Gly Val Glu Tyr Gly lie Ser Tyr Thr Thr Trp Asn Phe Asn Cys 
130 135 140 

Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
35 145 150 155 160 

Thr lie lie Arg Lys His Leu Thr His lie Ser Gin Ala Tyr Leu Thr 

165 170 175 

40 Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly lie Gly Ala Ala Asp 

180 185 190 



Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

45 
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Val Val Val Arg Ala Pro His lie Gin Glu Asn Lys Met Arg Trp Gly 
210 215 . 220 

Lys Asp Tyr Ala Thr Tyr lie lie Pro Arg Pro Tyr Ser Asn Gly Glu 
225 230 235 240 



Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

Phe Gly Phe Glu Thr Asp Asp lie Val Ser Arg Thr Thr Ser Leu Leu 

260 265 270 

Pro Lys lie Leu Asp Glu Pro Leu His lie He Arg Val Ala Ala Gly 
275 280 285 

Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
305 310 315 320 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 

Asp Asn Gin Lys Val Lys Ala Lys Leu 

340 345 



<210> 15 

<211> 12466 

<212> DNA 

<213> vector daaoScelTetON 
<220> 

<221> misc_feature 

<222> (38) . . (183) 

<223> Agrobacterium right border 
<220> 

<221> misc_f eature 

<222> (445) . . (462) 

<223> recognition / cleavage site for I-Scel endonuclease 



10 



15 



20 



25 
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<220> 

<221> terminator 

<222> (196) . . (400) 

<22 3> complementary: 3 5S terminator 
<220> 

<221> misc_feature 

<222> (515) . . (1222) 

<223> complementary: coding for I-Scel endonuclease 
<220> 

<221> promoter 

<222> (1270) . . (1660) 

<223> complementary: coding for pTOPlOP teracyclin regulatable promoter 
<220> 

<221> terminator 

<222> (1735) . . (1990) 

<223> complementary: NOS terminator 
<220> 

<221> misc_feature 

<222> (2067) . . (3173) 

<223> complementary: coding for Rhodotorrula gracilis D-amino acid oxida 
se 



<220> 

<221> promoter 

<222> (3217) . . (5028) 

30 <223> complementary: Arabidopsis thaliana nitrilase I promoter 

<220> 

<221> terminator 

<222> (5118) . . (5343) 

35 <223> complementary: OCS terminator 

<220> 

<221> misc__f eature 

<222> (5418) . . (6425) 

40 <223> complementary: coding for tetracyclin repressor rtTA 

<220> 

<221> promoter 

<222> (6479) . . (7341) 

45 <223> complementary: coding for Pisum sativum ptxA promoter 
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30 

<220> 

<221> terminator 
<222> (7345) . . (7549) 

<223> complementary: coding for 23S terminator (functioning as homology 



<220> 

<221> misc_feature 
<222> (7618) . . (7834) 
10 <223> Agrobacterium left border 

<400> 15 

aatattcaaa caaacacata cagcgcgact tatcatggac atacaaatgg acgaacggat 60 

aaaccttttc acgccctttt aaatatccga ttattctaat aaacgctctt ttctcttagg 120 

15 tttacccgcc aatatatcct gtcaaacact gatagtttaa actgaaggcg ggaaacgaca 180 

atcagatctg gtacccggtc actggatttt ggttttagga attagaaatt ttattgatag 240 

aagtatttta caaatacaaa tacatactaa gggtttctta tatgctcaac acatgagcga 3 00 

aaccctataa gaaccctaat tcccttatct gggaactact cacacattat tctggagaaa 360 

aatagagaga gatagatttg tagagagaga ctggtgattt ttgcgccggg taccccaaac 42 0 

20 tgtctcacga cgttttgaac ccagattacc ctgttatccc tagtcgagcg gccgccagtg 480 

tgatggatat ctgcagaatt cgccctttta gatcttattt caggaaagtt tcggaggaga 540 

tagtgttcgg cagtttgtac atcatctgcg ggatcaggta cggtttgatc aggttgtaga 600 

agatcaggta agacatagaa tcgatgtaga tgatcggttt gtttttgttg atttttacgt 660 

aacagttcag ttggaatttg ttacgcagac ccttaaccag gtattctact tcttcgaaag ■ 720 

25 tgaaagactg ggtgttcagt acgatcgatt tgttggtaga gtttttgttg taatcccatt 780 

taccaccatc atccatgaac cagtatgcca gagacatcgg ggtcaggtag ttttcaacca 840 

ggttgttcgg gatggttttt ttgttgttaa cgatgaacag gctagccagt ttgttgaaag 900 

cttggtgttt gaaagtctgg gcgccccagg tgattaccag gttacccagg tggttaacac 960 

gttctttttt gtgcggcggg gacagtaccc actgatcgta cagcagacat acgtggtcca 102 0 

30 tgtatgcttt gtttttccac tcgaactgca tacagtaggt tttaccttca tcacgagaac 1080 

ggatgtaagc atcacccagg atcagaccga tacctgcttc gaactgttcg atgttcagtt 1140 

cgatcagctg ggatttgtat tctttcagca gtttagagtt cggacccagg ttcattacct 12 00 

ggtttttttt gatgtttttc atatggtcga ctaaagggcg aattccagca cactggcggc 1260 

cgttactagc ccgggctcga gcaaatgtct agaaaggcct tatatacgta aagggtcttg 132 0 

35 cgaagactag atcactctat ctcgagttta ccactcccta tcagtgatag agaaaagtga 1380 

aagtcgagtt taccactccc tatcagtgat agagaaaagt gaaagtcgag tttaccactc 1440 

cctatcagtg atagagaaaa gtgaaagtcg agtttaccac tccctatccg tgatagagaa 1500 

aagtgaaagt cgagtttacc actccctatc agtgatagag aaaagtgaaa gtcgagttta 1560 

ccactcccta tcagtgatag agaaaatgaa agtcgagttt accactccct atcagtgata 1620 

40 gagaaaagtg aaagtcgagc tcggtaccga gctcgaattc agcacactgg cggccgttac 1680 

tagtggatca attcactggc cgtcgtttta caacgactca gagcttgaca ggaggcccga 174 0 

tctagtaaca tagatgacac cgcgcgcgat aatttatcct agtttgcgcg ctatattttg 1800 

ttttctatcg cgtattaaat gtataattgc gggactctaa tcataaaaac ccatctcata 1860 

aataacgtca tgcattacat gttaattatt acatgcttaa cgtaattcaa cagaaattat 1920 

45 atgataatca tcgcaagacc ggcaacagga ttcaatctta agaaacttta ttgccaaatg 1980 
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tttgaacgat cggggatcat ccgggtctgt 
agcaagatct agagcttggg tcccgcctac 
cgctggaacg cctcgtcgac gagctgcgcg 
tatcccgcac tcgagaagcc atacgcatgc 
5 cgtgcgctgc ccctgccgag cgagaggggc 
cgttctgcct caacgcgggg tccgcctcgt 
aggacctcga tgccttcgat cgttccgtcg 
tgcttgagga tccgctggac cgtctctggg 
tacgtcccgc cgcagatgac ttcgccacct 

10 gggtcggacg agtccatcgt gcatcgcttg 
cggattggct cggcggcttg gtcgtcgatg 
gcgttgacca ccaaatccgc accgtcgaac 
tcaaacgtcg cgccgagctt ctgcagctct 
tggacggaga gggtgtcgta ggttacgccg 

15 agggggcggt aatttggcgt gatgtccttg 
tgcgcgaacc gcctcgtccc cttgagccac 
cacttcttga aagtcgattc ttcccatttt 
aaaggcgtcc aattcgcgcc agcccatggt 
ggcaagtcgc gcgcgagaat atgcacgctg 

20 ctcagaccga taacgcctga tccgaggaca 
cgactagagt cgagatccga tatcgcccgg 
tgacactcag agacttgaga gaagcaatat 
cacgtttatt ttcctattgg agacggtaac 
aggtgggact agcccacgtg gtttcttttc 

25 aaagttcaca tcaaacctac taataattga 
accagccgat gctttatgtc tgaatataaa 
aacggtggtt tacaaagtat ttggacacaa 
aactaccaaa taaaaaacaa ataaaaaact 
agaatttatt cttttatgga ttccttaaat 

30 taggtaataa tatatactat gttatagata 
ttggtcgtac gattctacct aataatattg 
agtatttttt ttttggtttg tcttagctgt 
tgtcataaat aaaatttgac gtatagattt 
gaaatgtgag tttggctcct atttaatctt 

35 gaattatctc agagaaactg gattaaataa 
ggtcaaatat ctttggattt actattattg 
attcctacaa gagaaatcaa aatttttgaa 
ccaattagtt cgcatcaaga aaaccaaacc 
atatttcatt gcagctttca gataaaagaa 

40 ctttcttact tgcaggaact caaattccac 
attgtcaatg aaacgaaata gtctttttat 
tttctgattg tgacagccat ccatatatat 
acatatacgt aatggtttag catagcttct 
acatttttaa gaaaaataaa ttaatatatg 

45 acagtaactt aattattaaa cattctaaat 
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Qqcqggaact 


ccacgaaaat 


atccgaacgc 


2040 


aact:t:cgact 


cccgcgccgc 


gccgtggtac 


2100 


acatcctccq 


ccgcgcccca 


actctgctgg 


2160 


acaagcgtga 


cctccttctc 


cttcgccgct 


2220 


gaGtttgtcc 


ggtcgagagg 


caggacgatc 


2280 


cgtgcaggtc 


gcaagccgac 


gttgtggcgg 


2340 


ctcgagatgg 


tcgggtcgag 


gcgcaagcag 


2400 


ttgacagaca 


agtcccagtc 


tcccacgccg 


2460 


ggtcggggaa 


tgatgtaggc 


gggagaagcg 


2520 


catggggact 


tgacgaggac 


ggtttgcccg 


2580 


cccgcaatcg 


acttggcgcc 


aagtcccgta 


2640 


gcctgctcaa 


gcgacgtaac 


ggtccgtctc 


2700 


cttgcaaggt 


actggcagta 


ctttggtgcg 


2760 


atagcgccag 


gtggacattc 


ggaagatggg 


2820 


taccagtgcc 


cgagcaagcc 


gtcttcgttc 


2880 


atggcatggc 


ccgtcgggac 


caactcgacc 


2940 


gcttgtcgag 


gaccgtctgt 


aagcgtcatg 


3000 


gaagcgaaag 


tctggctcga 


gacgtcctcc 


3060 


tagcccttcc 


qaqcgaqqat 


qaqqqcqctq 


3120 


acQacQCQct. 


tctqcqaqtq 


catqqqccct 


3180 


qctcgagtct 


ttgtttttta 


ctttggttca 


3240 


ataQacbttz.t 


ttttqttttt 


tttttqtqqt 


3300 


gaagatcgaa 


CCtqtqqtqq 


aaatgaaaca 


3360 


tctgcattga 


tttgtttttg 


ttttttttgt 


3420 


gaagaaaaat 


aaaatctatt 


qat tqattaa 


3480 


aaagaagt:ga 


aaaccccgtt 


taagaatztac 


3540 


tiaaaticcaaa 


cqaaataaaa 


caaaatggag 


3600 


taaaaQaatt: 


tattccattt 
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tttttcccgt 


3660 


ccatatttga 


tgcattttga 


ttcctcataa 


3720 


tgtttctaat 


tcgtattaac 


ctaccttttt 


3780 


aacggaattg 


atgttttgga 


ccacttagaa 


3840 


atttcattaa 


atataaattt 


aaataagaaa 


3900 


t:t.t:aaat.cca 


ttttatgtta 


tttaatattt 


3960 


aQQatcfQQtt 


aatactaagt 


tttccttaat 


4020 


actaaaaaat 
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4080 


Qcqaaaaqaa 


agtctcatat 


aqtaaatcat 


4140 


ttaacatQcra 


t tQtataQtt 


tcttatataa 


4200 


cc aatitaatia 
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4440 


agggaatgta 
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4500 


attttcgttg 
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tcattccaaa 


4560 


tatattcttg 


gaactaatgt 


atgtggaaat 


4620 


gcaaatatgc 


aaagaaaaaa 


aagaaaagaa 


4680 
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cacaactgaa atcaaagcca gattcataat 
aacacaacat ccagaattga acactcaaat 
agagaattgg ttctcttcca ttattaacga 
ttgaaactca acataacatt ttgaacttga 
5 atcacaaccg attacttgtt tgagtcatct 
accggtttct tcgggtgctc atttacggtc 
ggtagggata acagggtaat ctgggttcaa 
aattcgagct cggtaccaat tcccatcttg 
aaataacaag tcaggtatta tagtccaagc 

10 attcagaaat atttcaataa ctgattatat 
gagtgcgata ttatgtgtaa tacataaatt 
atcttgcgcc gggtaccgag ctcggtagca 
ccaggagagt tgttgatcta cccaccgtac 
tgctcaaact cgaagtcggc catatccaga 

15 aatcccggac ccggggaatc cccgtccccc 
tcggcatgcg ccatcgccac gtcctcgccg 
tcggtcgggg gggccgtcga cagtctgcgc 
ggagccgcca gccccgcctc ttcgggggcg 
atggtagacc cgtaattgtt tttcgtacgc 

20 ttaagttgtt tttctaatcc gcatatgatc 
gcaccttggt gatcaaataa ttcgatagct 
gtaggtgttt ccctttcttc tttagcgact 
aaagtaaaat gccccacagc gctgagtgca 
tggcataaaa aggctaattg attttcgaga 

25 tctgaatgta cttttgctcc attgcgatga 
ttattgcgta aaaaatcttg ccagctttcc 
ctatctaaca tctcaatggc taaggcgtcg 
tacagtgtag gctgctctac accaagcttc 
attccgacct cattaagcag ctctaatgcg 

30 gacatggtcg atcgactcta gactagtgga 
ttcgaagatt ttagtgtaat gtgtgtgctc 
agaatgagtg atgaggttta tatggtgaaa 
tctttctcgt gagtcatatt cacggaccat 
tttttatgtt aaatcattga ttgattttta 

35 atgaattcaa acaaactagg ccagaaaaaa 
aatttgttta tttttttcat ggataagggg 
gacatttttt taacatgtag tgacaagtag 
aatcttaatc tttttttcat aaatctgcac 
gtgctaaact attggtaatt tctttttgta 

40 gaaatgtgtc atcctaaaaa caacattttg 
aatggaagaa gcaatatgtt gatgtttatt 
taattgacaa ctatgatttt atattggatt 
ggtttcaaag catgttatgc tagtgattct 
tccttgaatg gagggacaaa ctttgaatga 

45 ctcagcttca caaaaaattg cggacggtca 
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aattggctac 


atggttgtag 


aatgtagggt 


4740 


tggatgatag 


atggataatc 


tttagataca 


4800 


aaataaagaa 


aaaaagttta 


gcataaaagt 


4860 


ctccttcata 


99agtgacat 


gaactgacga 


4920 


tccgctttct 


ccaccttcga 


aatgaatgtg 


4980 


aagtgtaaaa 


catctggtct 


cgaggtacct 


5040 


aacgtcgtga 


gacagtttgg 


tgcaggtcga 


5100 


aaagaaatat 


agtttaaata 


tttattgata 


5160 


aaaaacataa 


atttattgat 


gcaagtttaa 


5220 


cagctggtac 


attgccgtag 


atgaaagact 


5280 


gatgatatag 


ctagcttagc 


tcatcggggg 


5340 


attcccgagg 


ctgtagccga 


cgatggtgcg 


5400 


tcgtcaattc 


caagggcatc 


ggtaaacatc 


5460 


gcgccgtagg 


999cggagtc 


gtggggggta 


5520 


aacatgtcca 


gatcgaaatc 


gtctagcgcg 


5580 


tctaagtgga 


gctcgtcccc 


caggctgaca 


5640 


gtgtgtcccg 


cggggagaaa 


ggacaggcgc 


5700 


tcgtcgtccg 


ggagatcgag 


caggccctcg 


5760 


gcgcggctgt 


acgcgqaccc 


actttcacat 


5820 
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cgaataagaa 


ggctggctct 


5880 


tgtcgtaata 


atggcggcat 


actatcagta 


5940 


tgatgctctt 


gatcttccaa 


tacgcaacct 


6000 


tataacgcgt 


tctctagtga 


aaaaccttgt 


6060 


gtttcatact 


gtttttctgt 


aggccgtgta 


6120 


cttagtaaaa 


cacatctaaa 


acttttagcg 


6180 


ccttttaaag 


ggcaaaagtg 


agtatggtgc 


6240 


agcaaagccc 


gcttattttt 


tacatgccaa 


6300 


tgggcgagtt 


tacgggttgt 


taaaccttcg 


6360 


ctgttaatca 


ctttactttt 


atctaatcta 


6420 
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6480 
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aattttgata 
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6660 


gtgggcctcg 
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6720 
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gtgtcttaag 
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tattgataga agtattttac aaatacaaat acatactaag ggtttcttat atgctcaaca 7440 

catgagcgaa accctataag aaccctaatt cccttatctg ggaactactc acacattatt 7500 

ctggagaaaa atagagagag atagatttgt agagagagac tggtgatttt ccgggggatc 7560 

ctctagagtc gaggtaccga gctcgaattc actggccgtc gttttacaac gactcagtac 7620 

5 tgcttggtaa taattgtcat tagattgttt ttatgcatag atgcactcga aatcagccaa 7680 

ttttagacaa gtatcaaacg gatgttaatt cagtacatta aagacgtccg caatgtgtta 7740 

ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 7800 

cagctccccg accggcagct cggcacaaaa tcaccacgcg tctaaaaagg tgatgtgtat 7860 

ttgagtaaaa cagcttgcgt catgcggtcg ctgcgtatat gatgcgatga gtaaataaac 7920 

10 aaatacgcaa ggggaacgca tgaaggttat cgctgtactt aaccagaaag gcgggtcagg 7980 

caagacgacc atcgcaaccc atctagcccg cgccctgcaa ctcgccgggg ccgatgttct 8040 

gttagtcgat tccgatcccc agggcagtgc ccgcgattgg gcggccgtgc gggaagatca 8100 

accgctaacc gttgtcggca tcgaccgccc gacgattgac cgcgacgtga aggccatcgg 8160 

ccggcgcgac ttcgtagtga tcgacggagc gccccaggcg gcggacttgg ctgtgtccgc 8220 

15 gatcaaggca gccgacttcg tgctgattcc ggtgcagcca agcccttacg acatatgggc 8280 

caccgccgac ctggtggagc tggttaagca gcgcattgag gtcacggatg gaaggctaca 834 0 

agcggccttt gtcgtgtcgc gggcgatcaa aggcacgcgc atcggcggtg aggttgccga 8400 

ggcgctggcc gggtacgagc tgcccattct tgagtcccgt atcacgcagc gcgtgagcta 8460 

cccaggcact gccgccgccg gcacaaccgt tcttgaatca gaacccgagg gcgacgctgc 8520 

20 ccgcgaggtc caggcgctgg ccgctgaaat taaatcaaaa ctcatttgag ttaatgaggt 85 8 0 

aaagagaaaa tgagcaaaag cacaaacacg ctaagtgccg gccgtccgag cgcacgcagc 864 0 

agcaaggctg caacgttggc cagcctggca gacacgccag ccatgaagcg ggtcaacttt 8700 

cagttgccgg cggaggatca caccaagctg aagatgtacg cggtacgcca aggcaagacc 8760 

attaccgagc tgctatctga atacatcgcg cagctaccag agtaaatgag caaatgaata 8820 

25 aatgagtaga tgaattttag cggctaaagg aggcggcatg gaaaatcaag aacaaccagg 8880 

caccgacgcc gtggaatgcc ccatgtgtgg aggaacgggc ggttggccag gcgtaagcgg 894 0 

ctgggttgtc tgccggccct gcaatggcac tggaaccccc aagcccgagg aatcggcgtg 9000 

agcggtcgca aaccatccgg cccggtacaa atcggcgcgg cgctgggtga tgacctggtg 9060 

gagaagttga aggccgcgca ggccgcccag cggcaacgca tcgaggcaga agacgccccg 912 0 

30 gtgaatcgtg gcaaggggcc gctgatcgaa tccgcaaaga atcccggcaa ccgccggcag 9180 

ccggtgcgcc gtcgattagg aagccgccca agggcgacga gcaaccagat tttttcgttc 924 0 

cgatgctcta tgacgtgggc acccgcgata gtcgcagcat catggacgtg gccgttttcc 9300 

gtctgtcgaa gcgtgaccga cgagctggcg aggtgatccg ctacgagctt ccagacgggc 9360 

acgtagaggt ttccgcaggg ccggccggca tggccagtgt gtgggattac gacctggtac 9420 

35 tgatggcggt ttcccatcta accgaatcca tgaaccgata ccgggaaggg aagggagaca 94 80 

agcccggccg cgtgttccgt ccacacgttg cggacgtact caagttctgc cggcgagccg 9540 

atggcggaaa gcagaaagac gacctggtag aaacctgcat tcggttaaac accacgcacg 9600 

ttgccatgca gcgtacgaag aaggccaaga acggccgcct ggtgacggta tccgagggtg 9660 

aagccttgat tagccgctac aagatcgtaa agagcgaaac cgggcggccg gagtacatcg 9720 

40 agatcgagct agctgattgg atgtaccgcg agatcacaga aggcaagaac ccggacgtgc 978 0 

tgacggttca ccccgattac tttttgatcg atcccggcat cggccgtttt ctctaccgcc 9840 

tggcacgccg cgccgcaggc aaggcagaag ccagatggtt gttcaagacg atctacgaac 9900 

gcagtggcag cgccggagag ttcaagaagt tctgtttcac cgtgcgcaag ctgatcgggt 9960 

caaatgacct gccggagtac gatttgaagg aggaggcggg gcaggctggc ccgatcctag 1002 0 

45 tcatgcgcta ccgcaacctg atcgagggcg aagcatccgc cggttcctaa tgtacggagc 10080 
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agatgctagg gcaaattgcc ctagcagggg aaaaaggtcg aaaaggtctc tttcctgtgg 10140 

atagcacgta cattgggaac ccaaagccgt acattgggaa ccggaacccg tacattggga 10200 

acccaaagcc gtacattggg aaccggtcac acatgtaagt gactgatata aaagagaaaa 10260 

aaggcgattt ttccgcctaa aactctttaa aacttattaa aactcttaaa acccgcctgg 10320 

5 cctgtgcata actgtctggc cagcgcacag ccgaagagct gcaaaaagcg cctacccttc 10380 

ggtcgctgcg ctccctacgc cccgccgctt cgcgtcggcc tatcgcggcc tatgcggtgt 1044 0 

gaaataccgc acagatgcgt aaggagaaaa taccgcatca ggcgctcttc cgcttcctcg 10500 

ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 10560 

gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 1062 0 

10 ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 1068 0 

cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 1074 0 

ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 10800 

accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 1086 0 

catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 10920 

15 gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 10980 

tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 1104 0 

agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 11100 

actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 1116 0 

gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 11220 

20 aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 112 80 

gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gcatgatata 1134 0 

tctcccaatt tgtgtagggc ttattatgca cgcttaaaaa taataaaagc agacttgacc 11400 

tgatagtttg gctgtgagca attatgtgct tagtgcatct aacgcttgag ttaagccgcg 11460 

ccgcgaagcg gcgtcggctt gaacgaattt ctagctagac attatttgcc gactaccttg 1152 0 

25 gtgatctcgc ctttcacgta gtggacaaat tcttccaact gatctgcgcg cgaggccaag 11580 

cgatcttctt cttgtccaag ataagcctgt ctagcttcaa gtatgacggg ctgatactgg 1164 0 

gccggcaggc gctccattgc ccagtcggca gcgacatcct tcggcgcgat tttgccggtt 11700 

actgcgctgt accaaatgcg ggacaacgta agcactacat ttcgctcatc gccagcccag 11760 

tcgggcggcg agttccatag cgttaaggtt tcatttagcg cctcaaatag atcctgttca 1182 0 

30 ggaaccggat caaagagttc ctccgccgct ggacctacca aggcaacgct atgttctctt 11880 

gcttttgtca gcaagatagc cagatcaatg tcgatcgtgg ctggctcgaa gatacctgca 1194 0 

agaatgtcat tgcgctgcca ttctccaaat tgcagttcgc gcttagctgg ataacgccac 12000 

ggaatgatgt cgtcgtgcac aacaatggtg acttctacag cgcggagaat ctcgctctct 12060 

ccaggggaag ccgaagtttc caaaaggtcg ttgatcaaag ctcgccgcgt tgtttcatca 1212 0 

35 agccttacgg tcaccgtaac cagcaaatca atatcactgt gtggcttcag gccgccatcc 12180 

actgcggagc cgtacaaatg tacggccagc aacgtcggtt cgagatggcg ctcgatgacg 12240 

ccaactacct ctgatagttg agtcgatact tcggcgatca ccgcttcccc catgatgttt 12300 

aactttgttt tagggcgact gccctgctgc gtaacatcgt tgctgctcca taacatcaaa 12360 

catcgaccca cggcgtaacg cgcttgctgc ttggatgccc gaggcataga ctgtacccca 124 20 

40 aaaaaacagt cataacaagc catgaaaacc gccactgcgt tccatg 12466 
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<213> daaoNit-PRecombination 



<220> 

5 <221> misc_feature 

<222> (38) . . (183) 

<223> Agrobacterium right border 
<220> 

10 <221> terminator 

<222> (196) . . (400) 

<223> complementary: 3 5S terminator 
<220> 

15 <221> misc_feature 

<222> (445) . . (462) 

<223> cleavage / recognition site for I-Scel endonuclease 
<220> 

20 <221> terminator 

<222> (589) . . (844) 

<223> complementary: nos terminator 
<220> 

25 <221> misc_feature 

<222> (921) . . (2027) 

<22 3> complementary: coding for Rhodotorula gracilis D-amino acid oxida 
se 

30 <220> 

<221> promoter 

<222> (2071) . . (3882) 

<223> complementary: A-thaliana nitrilase I promoter 

35 <220> 

<221> terminator 

<222> (3972) . . (4176) 

<223> complementary: 35S terminator 

40 <220> 

<221> miscfeature 

<222> (4251) . . (6248) 

<223> complementary: coding for beta -glucuronidase 
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<220> 

<221> promoter 

<222> (6302) . . (7619) 

<223> complementar . coding for sTPT promoter 
<220> 

<221> misc_feature 

<222> (7691) . . (7907) 

<223> Agrobacterium left border 



<400> 16 

aatattcaaa caaacacata cagcgcgact tatcatggac atacaaatgg acgaacggat 60 

aaaccttttc acgccctttt aaatatccga ttattctaat aaacgctctt ttctcttagg 120 

tttacccgcc aatatatcct gtcaaacact gatagtttaa actgaaggcg ggaaacgaca 180 

15 atcagatctg gtacccggtc actggatttt ggttttagga attagaaatt ttattgatag 240 

aagtatttta caaatacaaa tacatactaa gggtttctta tatgctcaac acatgagcga 300 

aaccctataa gaaccctaat tcccttatct gggaactact cacacattat tctggagaaa 360 

aatagagaga gatagatttg tagagagaga ctggtgattt ttgcgccggg taccccaaac 420 

tgtctcacga cgttttgaac ccagattacc ctgttatccc tagtcgagcg gccgccagtg 480 

20 tgatggatat ctgcagaatt cgccctttta gatcagcaca ctggcggccg ttactagtgg 54 0 

atcaattcac tggccgtcgt tttacaacga ctcagagctt gacaggaggc ccgatctagt 600 

aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat tttgttttct 660 

atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac 720 

gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa ttatatgata 780 

25 atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa 840 

cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga acgcagcaag 900 

atctagagct tgggtcccgc ctacaacttc gactcccgcg ccgcgccgtg gtaccgctgg 960 

aacgcctcgt cgacgagctg cgcgacatcc tccgccgcgc cccaactctg ctggtatccc 1020 

gcactcgaga agccatacgc atgcacaagc gtgacctcct tctccttcgc cgctcgtgcg 1080 

30 ctgcccctgc cgagcgagag gggcgacttt gtccggtcga gaggcaggac gatccgttct 1140 

gcctcaacgc ggggtccgcc tcgtcgtgca ggtcgcaagc cgacgttgtg gcggaggacc 1200 

tcgatgcctt cgatcgttcc gtcgctcgag atggtcgggt cgaggcgcaa gcagtgcttg 1260 

aggatccgct ggaccgtctc tgggttgaca gacaagtccc agtctcccac gccgtacgtc 132 0 

ccgccgcaga tgacttcgcc acctggtcgg ggaatgatgt aggcgggaga agcggggtcg 13 80 

35 gacgagtcca tcgtgcatcg cttgcatggg gacttgacga ggacggtttg cccgcggatt 144 0 

ggctcggcgg cttggtcgtc gatgcccgca atcgacttgg cgccaagtcc cgtagcgttg 1500 

accaccaaat ccgcaccgtc gaacgcctgc tcaagcgacg taacggtccg tctctcaaac 1560 

gtcgcgccga gcttctgcag ctctcttgca aggtactggc agtactttgg tgcgtggacg 1620 

gagagggtgt cgtaggttac gccgatagcg ccaggtggac attcggaaga tgggaggggg 1680 

40 cggtaatttg gcgtgatgtc cttgtaccag tgcccgagca agccgtcttc gttctgcgcg 1740 

aaccgcctcg tccccttgag ccacatggca tggcccgtcg ggaccaactc gacccacttc 1800 

ttgaaagtcg attcttccca ttttgcttgt cgaggaccgt ctgtaagcgt catgaaaggc 1860 

gtccaattcg cgccagccca tggtgaagcg aaagtctggc tcgagacgtc ctccggcaag 1920 

tcgcgcgcga gaatatgcac gctgtagccc ttccgagcga ggatgagggc gctgctcaga 1980 

45 ccgataacgc ctgatccgag gacaacgacg cgcttctgcg agtgcatggg ccctcgacta 2040 
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gagtcgagat ccgatatcgc ccgggctcga 
tcagagactt gagagaagca atatatagac 
tattttccta ttggagacgg taacgaagat 
gactagccca cgtggtttct tttctctgca 
5 cacatcaaac ctactaataa ttgagaagaa 
cgatgcttta tgtctgaata taaaaaagaa 
ggtttacaaa gtatttggac acaataaatc 
caaataaaaa acaaataaaa aacttaaaag 
tattctttta tggattcctt aaatccatat 

10 ataatatata ctatgttata gatatgtttc 
gtacgattct acctaataat attgaacgga 
ttttttttgg tttgtcttag ctgtatttca 
aaataaaatt tgacgtatag attttttaaa 
tgagtttggc tcctatttaa tcttaggatg 

15 tctcagagaa actggattaa ataaactaaa 
atatctttgg atttactatt attggcgaaa 
acaagagaaa tcaaaatttt tgaattaaca 
agttcgcatc aagaaaacca aaccccaatt 
cattgcagct ttcagataaa agaaaaaaac 

20 tacttgcagg aactcaaatt ccactttgcc 
aatgaaacga aatagtcttt ttatgcaaac 
attgtgacag ccatccatat atatagggaa 
acgtaatggt ttagcatagc ttctattttc 
ttaagaaaaa taaattaata tatgtatatt 

25 acttaattat taaacattct aaatgcaaat 
tgaaatcaaa gccagattca taataattgg 
acatccagaa ttgaacactc aaattggatg 
ttggttctct tccattatta acgaaaataa 
ctcaacataa cattttgaac ttgactcctt 

30 accgattact tgtttgagtc atcttccgct 
ttcttcgggt gctcatttac ggtcaagtgt 
gataacaggg taatctgggt tcaaaacgtc 
agctcggtac ccggtcactg gattttggtt 
attttacaaa tacaaataca tactaagggt 

35 ctataagaac cctaattccc ttatctggga 
gagagagata gatttgtaga gagagactgg 
gcaattcccg aggctgtagc cgacgatggt 
cctccctgct gcggtttttc accgaagttc 
gccgccgact tcggtttgcg gtcgcgagtg 

40 aatatgcctt gcgaggtcgc aaaatcggcg 
ctgacgcgat caaagacgcg gtgatacata 
cacatgtcgg tgtacattga gtgcagcccg 
ataatcggct gatgcagttt ctcctgccag 
gccgtttcca aatcgccgct ttggacatac 

45 tcaaagagat cgctgatggt atcggtgtga 
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gtctttgttt 


tttactttgg 


ttcatgacac 


2100 


ttttttttgt 


tttttttttg 


tggtcacgtt 


2160 


cgaacctgtg 


gtggaaatga 


aacaaggtgg 


2220 


ttgatttgtt 


tttgtttttt 


ttgtaaagtt 


2280 


aaataaaatc 


tattgattga 


ttaaaccagc 


2340 


gtaaaaaccc 


cgtttaagaa 


ttacaacggt 


2400 


caaacgaaat 


aaaacaaaat 


ggagaactac 


2460 


aatttattcc 


attttttttc 


ccgtagaatt 


2520 


ttgatgcatt 


ttgattcctc 


ataataggta 


2580 


taattcgtat 


taacctacct 


ttttttggtc 


2640 


attgatgttt 


tggaccactt 


agaaagtatt 


2700 


ttaaatataa 


atttaaataa 


gaaatgtcat 


2760 


tccattttat 


gttatttaat 


atttgaaatg 


2820 


ggttaatact 


aagttttcct 


taatgaatta 


2880 


aaataqatca 


atgtqttttg 


gtccggtcaa 


2940 


agaaagtctc 


atatagtaaa 


tcatattcct 


3000 


tggattgtat 


agtttcttat 


ataaccaatt 


3060 


aataatcaaa 


cgggcttggt 


aggaatattt 


.3120 


acacactcaa 


gtcttttatt 


tcatctttct 


3180 


acttttcttt 


acaaataaac 


acaaattgtc 


3240 


actgtttgtc 


ttttttcgat 


cacgtttctg 


3300 


tgtaaaacaa 


caacatgtga 


agtcacatat 


3360 


gttgtcaata 


ttagtcattc 


caaaacattt 


3420 


cttggaacta 


atgtatgtgg 


aaatacagta 


3480 


atqcaaagaa 


aaaaaagaaa 


agaacacaac 


3540 


ctacatggtt 


gtagaatgta 


gggtaacaca 


3600 


atagatggat 


aatctttaga 


tacaagagaa 


3660 


agaaaaaaag 


tttagcataa 


aagtttgaaa 


3720 


cataggagtg 


acatgaactg 


acgaatcaca 


3780 


ttctccacct 


tcgaaatgaa 


tgtgaccggt 


3840 


aaaacatctg 


gtctcgaggt 


acctggtagg 


3900 


gtgagacagt 


ttggtgcagg 


tcgaaattcg 


3960 


ttaggaatta 


gaaattttat. 


tgatagaagt 


4020 


ttcttatatg 


ctcaacacat 


gagcgaaacc 


4080 


actactcaca 


cattattctg 


gagaaaaata 


4140 


tgatttttgc 


gccgggtacc 


gagctcggta 


4200 


gcgccaggag 


agttgttgat 


tcattgtttg 


4260 


atgccagtcc 


agcgtttttg 


cagcagaaaa 


4320 


aagatccctt 


tcttgttacc 


gccaacgcgc 


4380 
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tccagccatg 


cacactgata 


ctcttcactc 


4500 


gctaacgtat 


ccacgccgta 


ttcggtgatg 


4560 


gccagaagtt 


ctttttccag 


taccttctct 


4620 


catccgtaat 


aacggttcag 


gcacagcaca 


4680 


gcgtcgcaga 


acattacatt 


gacgcaggtg 


4740 
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atcggacgcg tcgggtcgag tttacgcgtt 
acttgcggac gggtatccgg ttcgttggca 
ttgtcacgcg ctatcagctc tttaatcgcc 
actgcctctt cgctgtacag ttctttcggc 
5 gagaggttaa agccgacagc agcagtttca 
cagtcgagca tctcttcagc gtaagggtaa 
cagtccatta atgcgtggtc gtgcaccatc 
tccgcatctt catgacgacc aaagccagta 
tgttcgccct tcactgccac tgaccggatg 

10 gtctggcttt tggctgtgac gcacagttca 
tgcggattca ccacttgcaa agtcccgcta 
tccgcatcac gcagttcaac gctgacatca 
gcgtggttac agtcttgcgc gacatgcgtc 
ggcgtggtgt agagcattac gctgcgatgg 

15 taagactgct ttttcttgcc gttttcgtcg 
cagttcagtt cgttgttcac acaaacggtg 
tattagaaaa gttataaatt aaaatataca 
tatactacat tcttttattt tgaaaaaaat 
gataattatt atatatatat caaaggtaga 

20 ataacatacg gcgtgacatc ggcttcaaat 
tcctgattat tgacccacac tttgccgtaa 
cgctggcctg cccaaccttt cggtataaag 
taattacgaa tatctgcatc ggcgaactga 
cggctttctt gtaacgcgct ttcccaccaa 

25 agactgaatg cccacaggcc gtcgagtttt 
cgtaccatgg tcgatcgact ctagactagt 
atgaaatcga aattcagagt tttgatagtg 
tcgagtattt caagagatgg tacttgttga 
aatccagtat ctcgcggtgt atggagttat 

30 gtagagaatg atgtaattag ataagaatct 
gggtccatct tatcttgata agtggatggt 
gaagttccat atcaccatca tcatctgtat 
tgaaattttg gtagatttta tttttcatta 
tattgaacgt gacagattca caagaaagta 

35 taaaggtatc tctatttaat ttcttttgtt 
tttcatgctc atagtcatat acgttattct 
cgtaatacaa ttttataatt ttatcaaagt 
tacaaaacaa aaaaatcttc taattttttt 
aaaaatacaa atgaacccac atgtattatg 

40 gtactcaatt caccctatac tctttttttt 
ttctgtccat ttgaccaaat tcaattaatt 
atgttcacta tttctcgatt tgagggaacc 
atcactacaa atatgttata gtttatactt 
attaagtaaa catgattctt cttatgaagt 

45 gtgattgatg ttattggttg attgagtgat 
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gcttccgcca 


gtggcgaaat 


attcccgtgc 


4800 


atactccaca 


tcaccacgct 


tgggtggttt 


4860 


tgtaagtgcg 


cttgctgagt 


ttccccgttg 


4920 


ttgttgcccg 


cttcgaaacc 


aatgcctaaa 


4980 


tcaatcacca 


cgatgccatg 


ttcatctgcc 


5040 


tgcgaggtac 


ggtaggagtt 


ggccccaatc 


5100 


agcacgttat 


cgaatccttt 


gccacgtaag 


5160 


aagtagaacg 


gtttgtggtt 


aatcaggaac 


5220 


ccgacgcgaa 


gcgggtagat 


atcacactct 


5280 


tagagataac 


cttcacccgg 


ttgccagagg 


5340 


gtgccttgtc 


cagttgcaac 


cacctgttga 


5400 


ccattggcca 


ccacctgcca 


gtcaacagac 


5460 


accacggtga 


tatcgtccac 


ccaggtgttc 


5520 


attccggcat 


agttaaagaa 


atcatggaag 


5580 


gtaatcacca 


ttcccggcgg 


gatagtctgc 


5640 


atacctgcac 


atcaacaaat 


tttggtcata 


5700 


cacttataaa 


ctacagaaaa 


gcaattgcta 


5760 


atttgaaata 


ttatattact 


actaattaat 


5820 


agcagaaact 


tacgtacact 


tttcccggca 


5880 


ggcgtatagc 


cgccctgatg 


ctccatcact 


5940 


tgaqtgaccg 


catcgaaacg 


cagcacgata 


6000 


acttcgcgct 


gataccagac 


gttgcccgca 


6060 


tcgttaaaac 


tgcctggcac 


agcaattgcc 


6120 


cgctgaccaa 


ttccacagtt 


ttcgcgatcc 


6180 


ttgatttcac 


gggttggggt 


ttctacagga 


6240 


ggatccgata 


tcgcccgggc 


tcgactctag 


6300 


agagcaaaga 


gggacggact 


tatgaggatt 


6360 


tcggacggct 


acgatgatct 


cgatttggtt 


6420 


ggtagggtta 


atggtcaatt 


tcatctaacg 


6480 


tqagatactg 


gtttagattg 


gatgagtgta 


6540 


ttttagagac 


acagtgaata 


ttagccaatc 


6600 


aattttgttt 


ttttgqaaga- 


taataatgat 


6660 


tttaccttgt 


atgttgagtg 


gtcttcaaat 


6720 


gattttttat 


aaatgaaatt 


ttacttattt 


6780 


tatggttgtc 


tgtcagcatt 


tgacttgcag 


6840 


aggctttttt 


gaatatctta 


ttactttttt 


6900 


tatacaacta 


taactaaaat 


tagggttttc 


6960 


tgttgtagcc 


agtttactcg 


taagttacaa 


7020 


cgtttaacta 


ggattaccat 

J —J — ^ ^ 


gtactttcat 


7080 
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7140 


tctgtaattg 


cgatttaaaa 


ttaatattac 


7200 


cgagtttaaa 


tatgataaaa 


atgttgaccc 


7260 


aatagtggtg 


tttttgggga 


taattgatga 


7320 


tgattgagtg 


attattgtat 


gtaaacctat 


7380 


tattgtatta 


gtatgtaagc 


aaagatgatt 


7440 
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gttcttatga ggtaatttgt tactcattca 
tacgcaaaac aatagagaac ataaaagata 
tgatattgta gtatctatac atttatatat 
gatccggggg atcctctaga gtcgaggtac 
5 aacgactcag tactgcttgg taataattgt 
cgaaatcagc caattttaga caagtatcaa 
ccgcaatgtg ttattaagtt gtctaagcgt 
. accagccagc caacagctcc ccgaccggca 
aggtgatgtg tatttgagta aaacagcttg 
10 tgagtaaata aacaaatacg caaggggaac 
aaggcgggtc aggcaagacg accatcgcaa 
gggccgatgt tctgttagtc gattccgatc 
tgcgggaaga tcaaccgcta accgttgtcg 
tgaaggccat cggccggcgc gacttcgtag 
15 tggctgtgtc cgcgatcaag gcagccgact 
acgacatatg ggccaccgcc gacctggtgg 
atggaaggct acaagcggcc tttgtcgtgt 
gtgaggttgc cgaggcgctg gccgggtacg 
agcgcgtgag ctacccaggc actgccgccg 
20 agggcgacgc tgcccgcgag gtccaggcgc 
gagttaatga ggtaaagaga aaatgagcaa 
gagcgcacgc agcagcaagg ctgcaacgtt 
gcgggtcaac tttcagttgc cggcggagga 
■ ccaaggcaag accattaccg agctgctatc 
25 gagcaaatga ataaatgagt agatgaattt 
aagaacaacc aggcaccgac gccgtggaat 
caggcgtaag cggctgggtt gtctgccggc 
aggaatcggc gtgagcggtc gcaaaccatc 
tgatgacctg gtggagaagt tgaaggccgc 
30 agaagacgcc ccggtgaatc gtggcaaggg 
caaccgccgg cagccggtgc gccgtcgatt 
gattttttcg ttccgatgct ctatgacgtg 
gtggccgttt tccgtctgtc gaagcgtgac 
cttccagacg ggcacgtaga ggtttccgca 
35 tacgacctgg tactgatggc ggtttcccat 
gggaagggag acaagcccgg ccgcgtgttc 
tgccggcgag ccgatggcgg aaagcagaaa 
aacaccacgc acgttgccat gcagcgtacg 
gtatccgagg gtgaagcctt gattagccgc 
40 ccggagtaca tcgagatcga gctagctgat 
aacccggacg tgctgacggt tcaccccgat 
tttctctacc gcctggcacg ccgcgccgca 
acgatctacg aacgcagtgg cagcgccgga 
aagctgatcg ggtcaaatga cctgccggag 
45 ggcccgatcc tagtcatgcg ctaccgcaac 
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tccttttgca 


tatgagaaat 


tgtgttagcg 


7500 


tgtgtattta 


tttaaggtga 


cttttgttaa 


7560 


aacttgttga 


atttgagtat 


aagctatcag 


7620 


cgagctcgaa 


ttcactggcc 


gtcgttttac 


7680 


cattagattg 


tttttatgca 


tagatgcact 


7740 


acggatgtta 


attcagtaca 


ttaaagacgt 


7800 


caatttgttt 


acaccacaat 


atatcctgcc 


7860 


gctcggcaca 


aaatcaccac 


gcgtctaaaa 


7920 


cgtcatgcgg 


tcgctgcgta 


tatgatgcga 


7980 


gcatgaaggt 


tatcgctgta 


cttaaccaga 


8040 


cccatctagc 


ccgcgccctg 


caactcgccg 


8100 


cccagggcag 


tgcccgcgat 


tgggcggccg 


8160 


gcatcgaccg 


cccgacgatt 


gaccgcgacg 


8220 


tgatcgacgg 


agcgccccag 


gcggcggact 


8280 


tcgtgctgat 


tccggtgcag 


ccaagccctt 


8340 


agctggttaa 


gcagcgcatt 


gaggtcacgg 


8400 


cgcgggcgat 


caaaggcacg 


cgcatcggcg 


8460 


agctgcccat 


tcttgagtcc 


cgtatcacgc 


8520 


ccggcacaac 


cgttcttgaa 


tcagaacccg 


8580 


tggccgctga 


aattaaatca 


aaactcattt 


8640 


aagcacaaac 


acgctaagtg 


ccggccgtcc 


8700 


QQCcagcctg 


gcagacacgc 


cagccatgaa 


8760 


tcacaccaag 


ctgaagatgt 


acgcggtacg 


8820 


tgaatacatc 


gcgcagctac 


cagagtaaat 


8880 


tagcggctaa 


aggaggcggc 


atggaaaatc 


8940 


gccccatgtg 


tggaggaacg 


ggcggttggc 


9000 


cctgcaatgg 


cactggaacc 


cccaagcccg 


9060 


cggcccggta 


caaatcggcg 


cggcgctggg 


9120 


gcaggccgcc 


cagcggcaac 


gcatcgaggc 


9180 


qccqctgatc 


gaatccgcaa 


agaatcccgg 


9240 


aggaagccgc 


ccaagggcga 


cgagcaacca 


9300 


ggcacccgcg 


atagtcgcag 


catcatggac 


9360 


cgacgagctg 


gcgaggtgat 


ccgctacgag 


9420 


gggccggccg 


gcatggccag 


tgtgtgggat 


9480 


ctaaccgaat 


ccatgaaccg 


ataccgggaa 


9540 


cgtccacacg 


ttgcggacgt 


actcaagttc 


9600 


gacgacctgg 


tagaaacctg 


cattcggtta 


9660 


aagaaggcca 


agaacggccg 


cctggtgacg 


9720 


tacaagatcg 


taaagagcga 
«^ 


aaccgggcgg 


9780 
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aaaaaacaaQ 


9840 


tactttttga 


tcgatcccgg 


catcggccgt 


9900 


ggcaaggcag 


aagccagatg 


gttgttcaag 


9960 


gagttcaaga 


agttctgttt 


caccgtgcgc 


10020 


tacgatttga 


aggaggaggc 


ggggcaggct 


10080 


ctgatcgagg 


gcgaagcatc 


cgccggttcc 


10140 
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taatgtacgg agcagatgct agggcaaatt gccctagcag gggaaaaagg tcgaaaaggt 10200 

ctctttcctg tggatagcac gtacattggg aacccaaagc cgtacattgg gaaccggaac 10260 

ccgtacattg ggaacccaaa gccgtacatt gggaaccggt cacacatgta agtgactgat 10320 

ataaaagaga aaaaaggcga tttttccgcc taaaactctt taaaacttat taaaactctt 10380 

5 aaaacccgcc tggcctgtgc ataactgtct ggccagcgca cagccgaaga gctgcaaaaa 10440 

gcgcctaccc ttcggtcgct gcgctcccta cgccccgccg cttcgcgtcg gcctatcgcg 10500 

gcctatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca tcaggcgctc 10560 

ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc 1062 0 

agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa 10680 

10 catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt 10740 

tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg 10800 

gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg 10860 

ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag 10920 

cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc 10980 

15 caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa 11040 

ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg 11100 

taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc 11160 

taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac 11220 

cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg 11280 

20 tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt 11340 

gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt 11400 

catgcatgat atatctccca atttgtgtag ggcttattat gcacgcttaa aaataataaa 11460 

agcagacttg acctgatagt ttggctgtga gcaattatgt gcttagtgca tctaacgctt 11520 

gagttaagcc gcgccgcgaa gcggcgtcgg cttgaacgaa tttctagcta gacattattt 11580 

25 gccgactacc ttggtgatct cgcctttcac gtagtggaca aattcttcca actgatctgc 11640 

gcgcgaggcc aagcgatctt cttcttgtcc aagataagcc tgtctagctt caagtatgac 11700 

gggctgatac tgggccggca ggcgctccat tgcccagtcg gcagcgacat ccttcggcgc 11760 

gattttgccg gttactgcgc tgtaccaaat gcgggacaac gtaagcacta catttcgctc 11820 

atcgccagcc cagtcgggcg gcgagttcca tagcgttaag gtttcattta gcgcctcaaa 11880 

30 tagatcctgt tcaggaaccg gatcaaagag ttcctccgcc gctggaccta ccaaggcaac 11940 

gctatgttct cttgcttttg tcagcaagat agccagatca atgtcgatcg tggctggctc 12000 

gaagatacct gcaagaatgt cattgcgctg ccattctcca aattgcagtt cgcgcttagc 12060 

tggataacgc cacggaatga tgtcgtcgtg cacaacaatg gtgacttcta cagcgcggag 1212 0 

aatctcgctc tctccagggg aagccgaagt ttccaaaagg tcgttgatca aagctcgccg 12180 

35 cgttgtttca tcaagcctta cggtcaccgt aaccagcaaa tcaatatcac tgtgtggctt 12240 

caggccgcca tccactgcgg agccgtacaa atgtacggcc agcaacgtcg gttcgagatg 12300 

gcgctcgatg acgccaacta cctctgatag ttgagtcgat acttcggcga tcaccgcttc 12 3 60 

ccccatgatg tttaactttg ttttagggcg actgccctgc tgcgtaacat cgttgctgct 12420 

ccataacatc aaacatcgac ccacggcgta acgcgcttgc tgcttggatg cccgaggcat 124 80 

40 agactgtacc ccaaaaaaac agtcataaca agccatgaaa accgccactg cgttccatg 12539 



